BI. .CHNOLOGY 
SYSTEMS 
BRANCH 



RAW SEQUENCE LISTING 
ERROR REPORT 
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CENTER 1600/2900 



■The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following compiuter readable 



form: 



Application Serial Number: 



Source: 
Date Processed by STIC: 



01 



THE ATTACMEB PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOK C.itiii' SlJiiivilSSION QUESTIONS, FLEASE CONIACTBrLlRKSFEKCER, 703^38S4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: Datan21hclp@aflspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: paitaini3help@MSPtOogov or phone 703-306-4119 (R Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3,0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Versiom 3.0 
The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance witii format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25 . 

Checker Version 3,0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

IlTittp://www.Mspto«gov/web/offices/pac/cIhiecker 
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RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/09/441,242 TIME: 15:22:26 

Input Set : A: \8666-008seqlist: . txt 
Output Set: N:\CRF3\07262001\I441242.raw 

163 <210> SEQ ID NO: 4 

164 <211> LENGTH: 108 

165 <212> TYPE: PRT 

166 <213> ORGANISM: Artificial 

168 <220> FEATURE: 

169 <223> OTHER INFORMATION: MTCPl protein 

171 <220> FEATURE: 

172 <221> NAME/KEY: site 

173 <222> LOCATION: (1)..(108) 

174 <223> OTHER INFORMATION: Xaa - any amino acid 
177 <400> SEQUENCE: 4 

17 9 Met Ala Gly Glu Asp Val Gly Ala Pro Pro Asp His Leu Trp Val His 
180 15 10 15 

182 Gin Glu Gly lie Tyr Arg Asp Glu Tyr Gin Arg Thr Trp Val Ala Val 

183 20 25 30 

185 Val Glu Glu Glu Thr Ser Phe Leu Arg Ala Arg Val Gin Gin lie Gin 

186 35 40 45 

188 Val Pro Leu Gly Asp Ala Ala Arg Pro Ser His Leu Leu Thr Ser Gin 

189 50 55 60 

191 Leu Pro Leu Met Trp Gin Leu Tyr Pro Glu Glu Arg Tyr Met Asp Asn 

192 65 70 . 75 80 

194 Asn Ser Arg Leu Trp Gin lie Gin His His Leu Met Val Arg Gly Val 

195 85 90 95 
^ 197 Gin Glu Leu Leu Leu Lys Leu Leu Pro Asp Asp Xaa 

198 100 105 

200 <210> SEQ ID NO: 5 

201 <211> LENGTH: 4922 

202 <212> TYPE: DNA 

203 <213> ORGANISM: Artificial 

205 <220> FEATURE: 

206 <223> OTHER INFORMATION: genomic DNA of TCL-1 

208 <220> FEATURE: 

209 <221> NAME/KEY: modif ied_base 

210 <222> LOCATION: (1)..(4922) 

211 <223> OTHER INFORMATION: m = a or c 

212 r = a or g 

213 w = a or t 

214 s = c or g 

217 <220> FEATURE: 

218 <221> NAME/KEY: modif ied_base 

219 <222> LOCATION: (1)..(4922) 

220 <223> OTHER INFORMATION: y = c or t 

221 k = g or t 

222 v=a,corg 

223 h = a, c or t 

226 <220> FEATURE: 

227 <221> NAME/KEY: modif ied_base 

228 <222> LOCATION: (1)..(4922) 

229 <223> OTHER INFORMATION: d = a, g or t 
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230 






231 






234 






235 






237 






239 






241 






243 






245 


w- 


-> 


247 






249 






251 






253 






255 


w- 


-> 


257 






259 






261 






263 






265 






267 






269 






271 


w- 


-> 


273 


w- 


-> 


275 


w- 


-> 


277 


w- 


-> 


279 






281 






283 






285 






287 






289 






291 






293 






295 






297 






299 






301 






303 


w- 


-> 


305 


w- 


-> 


307 


w- 


-> 


309 






311 






313 






315 


w- 


-> 


317 


w- 


-> 


319 


w- 


-> 


321 






323 






325 



RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/09/441,242 TIME: 15:22:26 

Input Set : A:\8666-008seqlist.txt 
Output Set: N:\CRF3\07262001\I441242.raw 

b = c, g, or t 
n = a, c, g, or t 
<400> SEQUENCE: 5 

gtcgacttgt gaktyccmag magaggccca gaagtcccgg tccggcaaag cggaggggaa 60 

9<^99999999 tcttccaaga agaagaaagg gcccaaggtt caacccccgg tgccttctcc 120 

agcagcaagc aagagggcgg cgggtcggtt gtcgctgctg gccggggccc tccgaggaaa 180 

ggcgcggrcc agctggggcc gggtctgcgt tcccaggagc tgccaccgtt ccagggagca 240 

agtcaggccg ggacgttagc gcctgcgcgg gaccctcact tgccaccaag rmccccacaa 300 

accccgcccc atcctgyctt acgccccgcc ccaaggtcgg ttctccccga cccgggggtc 360 

ccgcccccaa ggnccgtcct ccccgccccc gccgsttggt ggcggccgca tgctgcccgg 420 

atataaaggg tcggccccac atcccaggga ccagcgagcg gccttgagag gctctggctc 480 

ttgcttctta ggcggcccga ggacgccatg gccgagtgcc cgacactcgg ggaggcagtc 540 

accgaccacc cggaccgcct gtgggcctgg gagaagttcg tgtatttgga cgagaagcag 600 

macgcctgcc tgcccttaac catcgaggta caaccacctt tggagcggat ggcgargcag 660 

caggggcasc ccctgggagc ttgggatncc ctaggaaggg cgaggactca aggagcactc 720 

actatggggc agggaggatc ccccacagat kaagccactt ttggagccgg sctctkgagg 780 

gatgaatagg agttcctcca ggcagggaag aagggtggga aaaccccaaa ggaatgtcgg 840 

tcaaaggggt ggacccagtg cctgtggagt gtgactataa tgttgactac agcaggcatt 900 

ttctgggctt cggggtccta atccttaaaa atgggtatct ctaagtgact catccatatg 960 

gccgattatc ggaatcatct caggtgggtc ccagaaatct gtatttttaa aaagaacccw 1020 

cmacagttta gggtccaacc caggcataac caaaacactg gcctaagagt tgtgaagtat 1080 

tttcccacct accctctggg ctttatttaa gamaaccaaa tttaacaagt gatgtcgtag 1140 

tataagcgcc ggtantkgaa ycaatattga cttttttaat gtgtgatgcc ttaagatggg 1200 

tccttaatcc atgttaagnt tttgttaaag aaatagataa gtcttttaca agtatttgga 1260 

tttactcaat gaaaaagagt canaaaatgt tcaaactctc tccaaacata cactgaagaa 1320 

agcataaaaa ttaimaaata tattagaaca cgtatgtcca gtagcaawca maaattattg 13 80 

agtgttgayt gtgtctctac agatgggaaa ctgaggcaca cmaaatgtac atttgtccga 1440 

ggtaagattg ctagtaggta atggggttgg aattctaggc tcttaaccac cacaaaatct 1500 

gcatttttat tggcatttca attttttaaa tatgttttta ctttaaaaat caagttaaat 1560 

acttactttt ttaaaatcaa aatttgaaga aataatttga agattcagtg gatttctttt 1620 

tttaaatctc tgagaaatct cttccctyca acgtgacacc maaaccmgcg aaccagacag 1680 

tttttcataa aatcatgaaa catgcyccmc maaaaataac ccactascaa actgtgggac 1740 

agattttgcc tcacatcatt gaaaaggcca gcawtctttt tctctctttc tttctttgkt 1800 

gttttttttt tttcctgtag awacagggtc tcgctctgtg acccaggctg gtctyaaact 1860 

cctggcctca agcgatcctc ctgcctctgc cttccaaagc actggaatta caagtgtgag 1920 

ccgctgcaac ccgccagaaa aaagtgtgcc tttcatggcc ctgtctgggt ggctagacac 1980 

gtgtgtgtgc tggtggtcct ggcccagcca gagttccctg agaggagcat gcatggccta 2040 

aggaagtgag cttcagggaa cagtgatgac catcatttca cactcggacc ccctgccmaa 2100 

gatgggtgga tgsctgscag ggagggattc cggtkttcct gcgcctggag aanccctgcc 2160 

aagcggaacc tgaaagtatn ccctgtcctt ttcttctcct nagataaagg ataggttaca 2220 

gttnngggtg ctcttgcgtc gggaagacgt cgtcctgggg aggcctatga cccccaccna 2280 

gataggccca agcctgctgc ctatcatgtg gcagctctac cctgatggac gataccgatc 2 340 

ctcagactcc agtttctggc gcttagtgta ccacatcaag gtgagtgtct ttctcccaga 2400 

ggtccatcgg ktgatcttgg gtttcccctc cycmatgtct gsccttagtg gtttaycttc 2460 

ccyccatccc agtssgcaaa agcattwaaa aratggggga nrtrwacmas tgcagatttc 2520 

tanaggactt taccagagag aaganagatc ctntgaggtc tctaanagaa ccctacctcc 2580 

acttcctccc anccaccanc taaccgcagg aagacatctc tggtggggmm kcacaggctg 2640 

aaggctggtg ggaggagggr caktctccaa gascccctga aatcctcaca cctgggttcc 2700 

tacctgctgt ttccagctag gggaagscsc aggagtgagg aatggaggga gtggagggct 2760 
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RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/09/441,24 2 TIME: 15:22:26 

Input Set : A:\8666-008seqlist.txt 
Output Set: N:\CRF3\07262001\I441242.raw 

327 ctggccgatc aatgccttct ctctctctct gcctctcaga ttgacggcgt ggaggacatg 2820 

329 cttctcgagc tgctgccaga tgactgatgt atggtgagct ccactggagc ctgacccctc 2880 

331 ttagtccaca gtggctgtat cagaaagaaa gaccacccct tctccatgaa ggcagtgcta 2940 

333 acccctcccc gactgctgcc atctgagggt ccctagggat gggagaggct tcctggaggc 3000 

335 actcatgtct cccttaccac ttcgggagcc aagggctttg gtaggcagcc ccctttatcg 3060 

337 cagctgctca tatctataaa gtacttcaca agtttcagct ggcactttca ttttaccatt 3120 

339 gctttttttt tctttgggag atgagtctgg ctctgtggcc caggctagag tgtagtgggt 3180 

341 gcaatctcag ctcactgaaa gctctgcctc ccgggttcac accattctcc tgcctcagcc 3240 

343 ctcggagtag ctgggactac aggcgcccgc caccacacct ggctaatttt tttttttttw 3300 

345 ttwtwttttt tagtagagmc ggggtttcac cgtgttagcc aggatggtct cgatctcctg 3360 

347 acctcatgat ctgcccgcct cggcctccca aagtgctggg attacaggca tgagccacca 3420 

34 9 cgtccggcct taccattgct ttattaaata agcactggtg cttgattata tcagctgagc 34 80 

351 cagatattag atacgctatt gagttttgrg gaaataagag taccaaaact cagaaatgag 3540 

353 ttgaagtata gtgacatctt cagattacag acccaggtgt cagaatttgc cttggctcag 3600 

355 aaggcctctg ggggccatcc ctgaccacta ggcttcccac ttagacctgc tccagcagca 3660 

357 ccacccctcg scactgcctg gtcctttcct tcacccttga ttctgtcttc ttttgtcctt 3720 

359 ctccaggtct tggyagcacc tgtctccttt caccccaggg cctgagcctg gccagcctac 3780 

361 aatggggatg ttgtgtttct gttcaccttc gtttactatg bctgtgtctt ctccaccacg 3840 

363 ctggggtctg ggaggaatgg acagacagag gatgagctct acccrgggcc tgsaggacct 3900 

W--> 365 gtcctgtagm ccactctgct cgccttagsa cctacsactc cwrccgasga ggatnccant 3960 

W--> 367 tggaagagct tcttnnaggt gncnaanaan anctgtgcgt nggcttttct cagctggatg 4020 

W--> 369 atggtcntna gcctctttct gtcccttctg tccctcacag cactagtatt tnatgttgca 4080 

371 cacccactca gctccgtgaa tttgtgagaa cacaaccgat tcacctgagc aggacctctg 4140 

W--> 3 73 aaaccctgga ccagtggtct cacatggtgc tacgcctgca tgtaaacacg cctncaaacg 4200 

375 ctgcctgcck gtraacacgm sksyrmacag stgmswrccc gtaaacacgc ctgcaaacgc 4260 

377 tgcctgccca cacaggttca cgtgcagctc aaggaaagrm ctgaaarrag cccttatctg 4320 

'79 tgctcaggac tcagaagcct ctgggtcagt ggtccacatc ccgggacgca gnaggaggcc 4380 

31 aggccggcga gccctgtgga tgagccctca gaacccttgg gttgcccacg tggaaaaggg 4440 

383 atagaggttg ggtttccccc cttttataga tggtcacgca cctgggtgtt acaaagttgt 4 500 

-> 385 atgtggcatg aatacttgnt gtnatgattg attaaatgca agatagttta tctaacttcg 4 560 

-> 3 87 tgcggaatca gcttctatcc ttgncttaga ttctggtgga gagaagtgan aataggcagn 4 620 

W--> 389 ccccanataa anaatattca ngggatttat tttattnttc cttttgggng atnngggact 4 680 

W--> 391 acattntncn nccccgtnta atccaatgnt taaancccca gtgttcttgg aggncntacg 4740 

W--> 393 tcgaanacca ttggngtang caacctcaaa atttttnngt tgnnaattnc cngttttcca 4800 

W--> 395 gagncccccc cntnctccat cttnntcctn gcccnccctn ncctcccnca ntcccnangt 4 860 

W--> 397 tnccctcgnc cccagtcagt tctttctccn nctttanccg ntnatntcac cagnttcttt 4 920 

399 ct 4922 

402 <210> SEQ ID NO: 6 

403 <211> LENGTH: 20 

404 <212> TYPE: DNA 

^> 405 <213> ORGANISM: Artificial 

407 <220> FEATURE: 

408 <223> OTHER INFORMATION: p9A primer 

410 <400> SEQUENCE: 6 

411 tgctgccaga tgactgatgt 20 

414 <210> SEQ ID NO: 7 

415 <211> LENGTH: 20 

416 <212> TYPE: DNA 

417 <213> ORGANISM: Artificial 

Use of n and/or Xaa has been detected in the Sequence Listing. 
Review the Sequence Listing to insure a corresponding 
r u.i _J) explanation is presented in the <220> to <223> fields 
^ ^ each sequence using n or Xaa. 
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VERIFICATION SUMMARY DATE: 07/26/2001 

PATENT APPLICATION: US/09/441,242 TIME: 15:22:27 

Input Set : A:\8666-008seqlist.t2t 
Output Set: N:\CRF3\07262001\I441242.raw 

L:19 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:1 
L:97 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:2 
L:99 M:258 W: Mandatory Feature missing, <220> FEATURE: 
L:99 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 

L:136 M:220 C:. Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 3 
L:166 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:4 
L:197 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L:203 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 5 



L 


:247 


M 


:341 


W: 


(46) 




or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


:257 


M 


:341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


:273 


M 


:341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


: 5 


L 


:275 


M 


:341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


:277 


M 


:341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


:5 


L 


:279 


M 


:341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


:305 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


:5 


L 


:307 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


309 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


317 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


319 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


321 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


:5 


L 


365 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


5 


L: 


367 


M 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


5 


L: 


369 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


5 


L: 


373 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


5 


L: 


379 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


5 


L: 


385 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


5 


L: 


387 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID#: 


5 


L: 


389 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID#: 


5 


L: 


391 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID#: 


5 


L: 


393 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID#: 


5 


L: 


395 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID#: 


5 


L: 


397 


M: 


341 


W: 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# : 


5 



L:405 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 6 

L:417 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 7 

L:429 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 8 

L:441 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 9 

L:453 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:10 

L:465 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:11 

L:477 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:12 

L:511 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 12 
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Raw Sequence Lbtlng Error Summary 



NOV I 3 2O01 
TECH CENTER (600/2900 



ERROR DETECTED 
ATTN: NEW RULES CASES: 



1 



Wrapped Nuclcics 
Wrapped Aminos 



SUGGESTED CORRECTION SERIAL NUMBER: 07/ TT (^ ^ WlL 

PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped** down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



2_ 
3 



Invalid Line Length The rules require that a line not exceed 72 characters in length. *fhis includes white spaces. 



Misaligned Amino 
Numbering 

Non-ASCII 



Variable Length 



_PatcntIn 2.0 
"bug*' 



7 Skipped Sequences 

(OLD RULES) 



(NEW RULES) 



_Use of n*s or Xaa*s 
(NEW RULES) 



10 



J 

11 ^ 1 



Invalid <2I3> 
(Response 

Use of<220> 



12 



_Patenthi 2.0 
"bug** 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved in ASCII text. 

\ ■ > V ' 

Sequence(s) contain n*s or Xaa*s representing more thah one rissidue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug** in Patcntin version 2.0 has c^ed <220>-<223>isection to be missmg from amino acid 

sequences(s) . Normally, Patenlln would' automatically, generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Arttnclal or Unknown sequences. 



Sequence(s) _ 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X** is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION :SEQ ID NO:X: (insert SEQ ID NO where "X** is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:** response to Include the skipped sequences. 



8 Skipped Sequences ' Sequence(s) 



missing. If Intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa*s have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa*s are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequence(s) 



_ missing the <220> "Feature** and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism** responise is "Artificial Sequence** or 
"Unknown.** Please explain source of genetic materia! in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63. No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk*' function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager** or any other manual means to copy file to floppy disk. 



1 3 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 



AMC/MH - Biotechnology Systems Branch - 08/21/2001 



Page 1 of 7 



3 
5 
7 
9 
10 
12 
14 
16 
17 
18 
19 
21 
22 
24 
25 
26 
28 
29 
30 
31 
33 
34 
35 
37 
38 
39 
41 
42 
43 
45 
46 
47 
49 
50 
51 
53 
54 
55 
57 
58 
59 
61 
63 
65 
67 
69 
71 



OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/441,242 



DATE: 07/26/2001 
TIME: 15:22:26 



Input Set : A:\8666-008seqlist, txt 
Output Set: N:\CRF3\07262001\I441242.raw 



Does Wot Comply 
corrected Diskette Needed 



<110> 
<120> 
<130> 
<140> 
<141> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<220> 
<221> 
<222> 
<400> 



APPLICANT: Russo, Giandomenico et al. 

TITLE OF INVENTION: TCl-1 Gene and Protein and Related Methods and Compositions 
FILE REFERENCE: 8666-008 

CURRENT APPLICATION NUMBER: 09/441,242 

CURRENT FILING DATE: 1999-11-16 

NUMBER OF SEQ ID NOS : 12 

SOFTWARE: Patentin version 3.0 

SEQ ID NO: 1 

LENGTH: 1324 

TYPE: DNA 

ORGANISM: Artificial 

FEATURE : 

OTHER INFORMATION: cDNA sequence of TCL-1 
FEATURE : 



NAME/KEY: CDS 
LOCATION: (49) . . (387) 
SEQUENCE: 1 

cttgagaggc tctggctctt gcttcttagg cggcccgagg acgccatg gcc gag tgc 

Ala Glu Cys 



57 



ccg 


aca 


etc 


ggg 


gag 


gea 


Pro 


Thr 
5 


Leu 


Gly 


Glu 


Ala 


tgg 


gag 


aag 


tte 


gtg 


tat 


Trp 


Glu 


Lys 


Phe 


Val 


Tyr 


20 










25 


tta 


acc 


ate 


gag 


ata 


aag 


Leu 


Thr 


lie 


Glu 


He 


Lys 










40 




egg 


gaa 


gac 


gtc 


gtc 


ctg 


Arg 


Glu 


Asp 


Val 


Val 


Leu 








55 






cca 


age 


ctg 


ctg 


ect 


ate 


Pro 


Ser 


Leu 


Leu 


Pro 


He 






70 








cga 


tec 


tea 


gac 


tec 


agt 


Arg 


Ser 


Ser 


Asp 


Ser 


Ser 



Val 
10 
ttg 
Leu 



Thr Asp His 



gac 
Asp 



gag 
Glu 



aag 
Lys 



gat agg tta eag 



ggg agg ect 



60 



45 
atg 



eag 
Gin 
30 
tta 



acc 



gac 



85 
ggc 



gtg gag gac atg 



75 

ttc tgg cgc tta gtg 
Phe Trp Arg Leu Val 
90 

ctt etc gag ctg ctg 



Asp Gly Val Glu Asp Met Leu Leu Glu Leu Leu 







1 








gac 


cgc 


ctg 


tgg 


gee 


105 


Asp 


Arg 


Leu 


Trp 


Ala 




15 












cac 


gee 


tgg 


ctg 


cec 


153 


His 


Ala 


Trp 


Leu 


Pro 
35 




egg 


gtg 


etc 


ttg 


egt 


201 


Arg 


Val 


Leu 


Leu 
50 


Arg 




cec 


ace 


eag 


ata 


ggc 


249 


Pro 


Thr 


Gin 
65 


He 


Gly 




ect 


gat 


gga 


cga 


tac 


297 


Pro 


Asp 
80 


Gly 


Arg 


Tyr 




tac 


cac 


ate 


aag 


att 


345 


Tyr 


His 


He 


Lys 


He 




95 












cca 


gat 


gac 






387 


Pro 


Asp 


Asp 









100 



105 



110 



tgatgtatgg tettggcagc acctgtctcc tttcacccea gggectgagc ctggccagcc 447 

tacaatgggg atgttgtgtt tctgtteacc ttegtttact atgeetgtgt ettetecaec 507 

acgetggggt ctgggaggaa tggacagaea gaggatgagc tetacccagg geetgeagga 567 

cctgectgta geccactctg etegcettag caetaccact cetgccaagg aggattccat 627 

ttggcagagc ttcttecagg tgeecagcta tacctgtgce tcggcttttc teagctggat 687 

gatggtctte agectcttte tgtcccttct gtccetcaca gcactagtat ttcatgttgc 747 
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RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/09/441,24 2 TIME: 15:22:26 

Input Set : A:\8666-008seqlist.txt 
Output Set: N:\CRF3\07262001\I441242.raw 

73 acacccactc agctccgtga acttgtgaga acacagccga ttcacctgag caggacctct 807 
75 gaaaccctgg accagtggtc tcacatggtg ctacgcctgc atgtaaacac gcctgcaaac 867 
77 gctgcctgcc ggtaaacacg cctgcaaacg ctgcctgccc gtaaacacgc ctgcaaacgc 927 
79 tgcctgccca cacaggttca cgtgcagctc aaggaaaggc ctgaaaggag cccttatctg 987 
81 tgctcaggac tcagaagcct ctgggtcagt ggtccacatc ccgggacgca gcaggaggcc 1047 
83 aggccggcga gccctgtgga tgagccctca gaacccttgg cttgcccacg tggaaaaggg 1107 
85 atagaggttg ggtttccccc ctttatagat ggtcacgcac ctgggtgtta caaagttgta 1167 
87 tgtggcatga atactttttg taatgattga ttaaatgcaa gatagtttat ctaacttcgt 1227 
89 gcgcaatcag cttctatcct tgacttagat tctggtggag agaagtgaga ataggcagcc 1287 
91 cccaaataaa aaatattcat ggaaaaaaaa aaaaaaa 1324 

94 <210> SEQ ID NO: 2 

95 <211> LENGTH: 113 

96 <212> TYPE: 

97 <^JJ^ 
'^<220: 

,<223V OTHER INFORMATION: 

SEQUENCE: 2 



.> LENGTH: 113 

!> TYPE : PRT " ^ / I ^ Jh^ 

t> ORGANISM: C^tificial^ ^ ill P ^ A-tJLyy^-n^^^^L^^ ..A^^mJ^ 



101 


Ala 


Glu 


Cys 


Pro 


Thr 


Leu 


Gly 


Glu 


Ala 


Val 


Thr 


Asp 


His 


Pro 


Asp 


Arg 


102 


1 








5 










10 










15 




105 


Leu 


Trp 


Ala 


Trp 


Glu 


Lys 


Phe 


Val 


Tyr 


Leu 


Asp 


Glu 


Lys 


Gin 


His 


Ala 


106 








20 










25 










30 






109 


Trp 


Leu 


Pro 


Leu 


Thr 


He 


Glu 


He 


Lys 


Asp 


Arg 


Leu 


Gin 


Leu 


Arg 


Val 


110 






35 










40 










45 








113 


Leu 


Leu 


Arg 


Arg 


Glu 


Asp 


Val 


Val 


Leu 


Gly 


Arg 


Pro 


Met 


Thr 


Pro 


Thr 


114 




50 










55 










60 










117 


Gin 


He 


Gly 


Pro 


Ser 


Leu 


Leu 


Pro 


He 


Met 


Trp 


Gin 


Leu 


Tyr 


Pro 


Asp 


118 


65 










70 










75 










80 


121 


Gly 


Arg 


Tyr 


Arg 


Ser 


Ser 


Asp 


Ser 


Ser 


Phe 


Trp 


Arg 


Leu 


Val 


Tyr 


His 


122 










85 










90 










95 




125 


He 


Lys 


He 


Asp Gly Val 


Glu 


Asp 


Met 


Leu 


Leu 


Glu 


Leu 


Leu 


Pro 


Asp 


126 








100 










105 










110 






129 


Asp 

































133 <210> SEQ ID NO: 3 

134 <211> LENGTH: 560 

135 <212> TYPE: DNA 

Qt^> 136 <213> ORGANISM: Artificial 

138 <220> FEATURE: 

13 9 <22 3> OTHER INFORMATION: genomic sequence of TCL-1 

141 <400> SEQUENCE: 3 

142 gtcgactgtg agttcccagc agaggcccag agtcccggtc cggcagccga gggaagcggg 60 
144 ggggtcttcc agaagaagaa agggccaagg tcaccccggt gcctctccag cagcagcaga 120 
146 gggcggcggt cggtgtcgct gctggccggg gcctcgagga aggcgcgggc cagctggggc 180 
148 cgggtctgcg ttcccaggag ctgccaccgt tccagggagc aagtcaggcc gggacgttag 240 
150 cgcctgcgcg ggaccctcac ttgccaccaa ggaccccaca aaccccgccc catccttagc 300 
152 gcctgcgcgg gaccctcact tgccaccaag acccccacaa accccgcccc atcctgcctt 360 
154 acgccccgcc ccaaggtcgt tctcccgacc cggggtcccg ccccaagacc gtcctcccgc 420 
156 cccgccgctt ggtggcggcc gcatgctgcc cggatataaa gggtcggccc cacatcccag 480 
158 ggaccagcga gcggccttga gaggctctgg ctcttgcttc ttaggcggcc cgaggacgcc 540 
160 atggccgagt gcccgacact 560 
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